A Hierarchical Approach for Clusters in Different Densities
نویسندگان
چکیده
Clustering has the following challenges: 1) clusters with arbitrary shapes; 2) minimal domain knowledge to determine the input parameters; 3) scalability for large data sets. Density-based clustering has been recognized as a powerful approach for discovering clusters with arbitrary shapes. However, the other two challenges still remain in most existing clustering algorithms. In this paper, we explore a hierarchical and iterative densitybased clustering method for large data sets with clusters in different densities. We meet the second challenge by reducing input parameters and solve the third challenge by means of hashing techniques and a vertical data structure, P-tree1 . Our experiments with three different data sets show that our approach is more efficient and robust than DBSCAN, TURN*, and K-means with better clustering qualities.
منابع مشابه
A Clustering Based Location-allocation Problem Considering Transportation Costs and Statistical Properties (RESEARCH NOTE)
Cluster analysis is a useful technique in multivariate statistical analysis. Different types of hierarchical cluster analysis and K-means have been used for data analysis in previous studies. However, the K-means algorithm can be improved using some metaheuristics algorithms. In this study, we propose simulated annealing based algorithm for K-means in the clustering analysis which we refer it a...
متن کاملGraph Clustering by Hierarchical Singular Value Decomposition with Selectable Range for Number of Clusters Members
Graphs have so many applications in real world problems. When we deal with huge volume of data, analyzing data is difficult or sometimes impossible. In big data problems, clustering data is a useful tool for data analysis. Singular value decomposition(SVD) is one of the best algorithms for clustering graph but we do not have any choice to select the number of clusters and the number of members ...
متن کاملThe functional response of Aphidius ervi (Haliday)(Hym.: Braconidae, Aphidiinae) to different densities of Sitobion avenae (Fabricius)(Hom.: Aphididae) on two wheat cultivars
The functional response of Aphidius ervi to different Sitobion avenae densities on two wheat cultivars (Sardary and Alvand) was examined in laboratory conditions. Experiments were carried out in test tubes on an F2 lab generation without wheat clusters and also on F2 and F5 generations in pots using wheat clusters. In the tubes, female wasps were exposed to aphid densities of 2, 4, 7, 14, 28, 3...
متن کاملImprovement of density-based clustering algorithm using modifying the density definitions and input parameter
Clustering is one of the main tasks in data mining, which means grouping similar samples. In general, there is a wide variety of clustering algorithms. One of these categories is density-based clustering. Various algorithms have been proposed for this method; one of the most widely used algorithms called DBSCAN. DBSCAN can identify clusters of different shapes in the dataset and automatically i...
متن کاملLocal Density-based Hierarchical Clustering for Overlapping Distribution using Minimum Spanning Tree
In this paper, we propose a clustering algorithm to find clusters of different sizes, shapes and densities. Density and Hierarchical based approaches are adopted in the algorithm using Minimum Spanning Tree, resulting in a new algorithm – Local Density-based Hierarchical Clustering Algorithm for overlapping data distribution using Minimum Spanning Tree (LDHCODMST). The algorithm is divided into...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2006